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Abstract. In this paper, a novel framework for automated liver seg¬ 
mentation via a level set formulation is presented. A sparse represen¬ 
tation of both global (region-based) and local (voxel-wise) image infor¬ 
mation is embedded in a level set formulation to innovate a new cost 
function. Two dictionaries are build: A region-based feature dictionary 
and a voxel-wise dictionary. These dictionaries are learned, using the 
K-SVD method, from a public database of liver segmentation challenge 
(MICCAI-SLiver07). The learned dictionaries provide prior knowledge 
to the level set formulation. For the quantitative evaluation, the pro¬ 
posed method is evaluated using the testing data of MICCAI-SLiver07 
database. The results are evaluated using different metric scores com¬ 
puted by the challenge organizers. The experimental results demonstrate 
the superiority of the proposed framework by achieving the highest seg¬ 
mentation accuracy (79.6%) in comparison to the state-of-the-art meth¬ 
ods. 

1 Introduction 

The liver is among the most common human organs to undergo invasive surgeries. 
In the case of liver tumors, an accurate 3D liver segmentation is important 
because a resection of the liver has to be carefully planned in order to preserve as 
much of the liver as possible [1]. Usually, computer tomography (CT) images are 
acquired for these purposes. The challenging aspects of liver segmentation in CT 
scan images can be summarized as follows: Firstly, the overlapping boundaries 
between the liver and surrounding organs such as heart, stomach, right kidney, 
and spleen, as illustrated in Fig. 1. Secondly, the large variability in liver shape, 
intensity distributions, and geometric properties from patient to patient, which 
make it difficult to describe the liver with model-based approaches [1]. Finally, 
liver segmentation, using a slice-by-slice approach, in 2D space is time consuming 
and gives inaccurate results. Therefore, for accurate segmentation results, volume 
segmentation methods in 3D space are more efficient [1]. 

In recent years, a variety of methods have been proposed to segment the liver. 
In 2007, liver segmentation competitions from CT data were held in conjunction 
with MICCAI [2]. Between 2007-2014, more than 35 automatic methods were 
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Fig. 1. Overlapped boundaries in CT images of MICCAI-SLiver07 training data [2]. 
The red ellipses in the first row show the overlap of the liver-heart, liver-stomach, 
and liver-kidney boundaries respectively. The second row shows the liver ground truth 
segmentation. 


proposed and evaluated by the MICCAI-SLiverOT challenge organizers [2]. Most 
of them achieved segmentation accuracies between 60% - 77%. The best auto¬ 
matic method cited on the MICCAI-SLiver07 challenge [2] achieves 77.3% [3]. 
This method proposed a statistical deformable model with an intensity distribu¬ 
tion model and modified the model segmentation results by a locally constrained 
free form deformation. Wimmer et al. [4] propose a probabilistic active shape 
model (ASM) using a level set method based on Parzen density estimation. This 
method is better than the ASM because it is able to capture more statistical 
information than the ASM. Linguraru et al. [5] propose a generic affine invari¬ 
ant shape parametrization using a graph-cut method for liver and liver tumor 
segmentation. 

Moreover, there are many liver segmentation methods that were not involve 
in MICCAI-SLiver07 competition. Ling et al. [6] propose a hierarchical learning 
method to build a liver shape model. Wang et al. [7] propose a method to built a 
sparse representation of liver shape. This method needs a large training data to 
perfectly represent the shape prior. Zhang et al. [8] propose a voxel-wise-based 
segmentation method. This method considers only local image information to 
create the sparse representation of liver shape prior. Huang et al. [9] propose 
an automatic method for liver segmentation. This method is based on proba¬ 
bilistic atlas. Tomoshige et al. [10] propose a conditional statistical shape model 
(SSM) using non-contrast CT images. Huang et al. [11] propose an ASM based 
automatic liver segmentation method. This method is based on the ASM and 
boundary profile classification with free-form deformation. 

Many approaches have shown that variational formulation is the most ef¬ 
fective method to solve image segmentation problems but it needs well defined 
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region boundaries or prior knowledge [12]. In recent years, voxel-wise-based dic¬ 
tionary learning and sparse coding methods have been popular and useful tools 
in signal processing and machine learning. In image segmentation, voxel-wise- 
based approaches, although they can be useful, achieve limited success because 
of the non-linear distribution of the image data such as the overlaps between re¬ 
gion boundaries in medical images [13]. Therefore, a perfect segmentation needs 
a non-linear separation of the image data. One useful solution to this problem 
is the mapping of the image data from the data space to the feature space. Such 
a mapping allows for a linear separation of the overlapping regions. 

To solve the above mentioned limitations of the variational formulation and 
voxel-wise dictionary learning methods, in this paper, we present a novel fully 
automatic liver segmentation framework via a level set formulation using dictio¬ 
nary learning and sparse coding with shape prior. The proposed level set energy 
equation consists of data and regularization terms. These terms have been inte¬ 
grated into the implicit framework in a novel fashion. The data term consists of 
the sparse representation of global image features, and the shape prior, which 
represents the voxel-wise sparse representation (local image information) of liver 
shapes. The regularization term is modified to include the shape prior. The main 
contribution of this paper is the embedding of the global and local sparse rep¬ 
resentation of image information into the data term to integrate a novel cost 
function of the image data. Two dictionaries are built: one using global image 
features (texture, CT number, and volume properties) and one using voxel-wise- 
based learning. The remaining sections are organized as follows. The proposed 
framework is described in detail in Sec. 2. In Sec. 3, experimental results are 
presented and discussed. Finally, this work is concluded in Sec. 4. 


2 Method 

2.1 Feature Extraction 

Sets of features are considered to learn the dictionaries: texture, volume prop¬ 
erties, and Hounsfield scale (HU). Hounsfield scale (HU) or called CT-number 
of the liver varies between (-1-40 — \-60HU). A set of texture features is consid¬ 
ered in our method (entropy, energy, contrast, homogeneity, sum mean, variance 
correlation, maximum probability, inverse difference moment, cluster tendency). 
These features have been computed using the gray level co-occurrence matrix 
(GLCM) for four different offsets (0°, 45°, 90°, and 135°) [15]. Volume prop¬ 
erties are important and they refer to the topological features of the liver. We 
use the volume, surface area, Euler number, major axis length, and minor axis 
length [16]. The total number of features used, in our method, is 42 features. 


2.2 Dictionary Learning 

In our framework, MICCAI-SLiverO? training data [2] (20 patients CT scan 
image data) are used for learning. Using the ground truth segmentation, each 
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image data of the training set is considered as two labels (liver/non-liver). A 
dictionary is built for each label. Let c = 1,2 be a number of the labels in the 
image data, are the training set of each label. The goal is to build a dictionary 
matrix, for each label, that has a perfect representation of Nc- 

Let D(. be a dictionary nx zx Kc matrix = {di, d 2 , ■■■, cIk^), which consists 
of Kc atoms (columns), {di G : i = 1,2, ...,Kc} and each atom represents 

the key features of Yc, where {Kc Nc). Yc = {yi,y 2 , is a n x z x Nc 

matrix which consists of feature matrices {yi G : i = 1, 2,..., Nc} of Nc 

data samples with dimension nx z. n = 42 is the number of features used to train 
the dictionaries, and z = 160. The sparse representation Ac = (ai, 02 ,..., oat^) G 
j^K^xNaXz jg computed s.t. yi = Dctti and ||ai||o << Kc,i = 1, ■■■,Nc. In such a 
way that each feature matrix in Yc is represented by linear combination of a few 
atoms in the dictionary according to the non-zero elements in Ac- The problem 
can be formulated as the following minimization: 

arg min \\Yc - TIc^cIIf s.t. VI < i < Nc, ||ai||o < Kc (1) 

Dq ,-<4c 

To solve Eq. (1), we propose the use of K-SVD method [14]. This method is 
robust to solve the problem in Eq. (1) by iterating Kc times of singular value 
decomposition (SVD) for the dictionary update step. Eor sparse coding step, we 
use the orthogonal matching pursuit (OMP). 


2.3 The Proposed Level-Set Framework 

The proposed energy equation is based on the piece-wise constant of the Mumford- 
Shah variational model [12]: 

F{M,B) = / {I -M)^dx 
Jr 

' -V-^ 

data term 

where M : R —>■ TZ'^ is a constant approximation of the observed image I. r{B) 
is the length of the boundary of M. 

Due to the difficulties illustrated in Eig. 1, in this paper, we consider two 
prior knowledge to formulate the proposed level-set formulation: the global image 
features, and the local image information, which represents the shape prior. 

We consider two steps for automatic initial liver localization in the target 
image. The first step is thresholding the target image. This step is based on that 
the gray level range of the liver is between 125-155. Second, the CT number of 
the liver is between (-1-40 — \-QQHU). The second step is based on the criteria 
that the liver is the biggest organ in the abdominal cavity and it lays on the 
right side. In this step, the image data (after thresholding) is divided into left and 
right side images. The centroid of the segmented volume in the right side image 
is computed. Then, from the centroid, 16 x 16 x 16 bounding box is computed 
to represent the initial level set. 


+ ( 2 ) 
regularization term 
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Global (Region-based) Image Features In this subsection, we describe the 
proposed method using the learned dictionary with the level set formulation. 
Starting from the Mumford variational model [12], the data term — M)^) 
in Eq. (2) can be considered as a k-means clustering problem if the regularization 
term is ignored. Therefore, this term can be reformulated as £2 norm, assuming 
that the £2 is a generalization of k-means clustering problem: 

E^iA,) = P(min || Vf - D,A, H^) + A|6| (3) 

-^c 


P(/) = 


> 0 if leVf 

< 0 if z e 1 //°™ 


H{P) = 


1 if P > 0 
0 if P < 0 


( 4 ) 

( 5 ) 


where E is the energy function. Vf is a matrix of feature samples of the liver, with 
the size n x z, in the target image. A > 0 is the regularization parameter and b is 
the contour length. I is the label of voxels in Vf , and Vf°"^ is the complement of 
Vf. P{1) represents the descriptor of the liver volume in the target image. H{P) 
is a Heaviside function. 


Local (voxel-wise) Image Information (Shape Prior) Since the gray- 
values in all organs are highly similar, the global image information is infeasible 
to achieve perfect result. Therefore, we add the local image information as prior 
knowledge. The local image information is represented by the shape prior, which 
represents the sparse representation (voxel-wise) of the liver shape. 

Using the ground truth segmentation of the training data [2], we build the 
voxel-wise sparse representation of N liver shapes. From the image data, patches 
are extracted. Each patch, with a size 16xl6xz, is concatenated in a matrix with 
size Us X z, where = 256 and z = 160. Thus, = [di,d 2 , ■. ■, dicj € jin^,z,Ks ^ 
{di G : i = 1,2, ...,Ks}, and Kg is the total number of columns in Dg. 

In such a way that each sample matrix in G i = 1,2, is 

represented by linear combination of a few atoms in the dictionary Eg according 
to the non-zero elements in the sparse representation a. a = (ai, 02 , ■■■, ajv) G 
rKs^n -g computed s.t. Yg. = DgUi and ||Q;i||o << Kg,i = 

Let J be a test image data and let Vt be an initial liver volume in I, i.e. 
initially detected VOI as explained in the beginning of Section 2.3. To find 
the desired shape, the patches are extracted and concatenated in a matrix as 
explained above. Then, we solve the following minimization problem: 

min II a llo s.t. || U* - L>sQ; || 2 < e (6) 

One possible solution of Eq. 6 is the replacement of the £q norm with £i norm [17] : 

Eghape{a) = min || Vt - DgU II 2 +A || a ||i (7) 

Oi 
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In Eq. (7), the first term is the data term. The second term is the regularization 
term of the shape prior. If the solution of Eq. (7) exists, the desired shape is 
obtained, i.e. || a ||i should has only one non-zero entrance which represents the 
desired shape prior. 

Combining Eq.s ( 3) and (7), the overall proposed energy equation can be 
formulated as follows: 


Etotai{Ae,a)= ^ + A(|6|+ II a ||i) (8) 

Foreground Background Regularization 

where G is: 

G = {uAxi\\D,A,-Vf\\l){uAxi\\D,a-Vt\\l), = 1 - G (9) 

Ac a 

Level-Set Optimization This subsection presents the optimization of the pro¬ 
posed energy function (Eq. (8)). The main task, to get this minimization, is by 
hnding the best matching between the sparse representation of the target image 
data with learned dictionaries and Dg- 

Once the norm is applied to the constraint term, Eq. (8) become difficult 
to solve. Here, we adopt the concept of iteratively re-weighted (IR) algorithm to 
handle this challenge. We adopt the general idea of IR algorithm by reformulating 
the minimization problem in Eq. (8) to the weighted mean square error (MSE) at 
the iteration, given by: At iteration (t), we consider W* as the weight of the 
volume of interest (VOI) at (t—I) iteration, i.e. Wl = H min^^ || Vf — DcA*~^ 
and W 2 = H mincc || Vt — Dsa*~^ |||: 


Al ^ 

( 10 ) 

a* 

( 11 ) 


In all experiments, A is set to 0.7. 

3 Experimental Results and Discussion 

In this paper, several experiments have been conducted on diverse medical im¬ 
ages to explore the advantages of the proposed method. We use a public CT-scan 
medical image database, namely, the segmentation of the liver 2007 (MICCAI- 
SLiver07) database [2] (20 patients training data and 10 patients testing data). 
The MICCAI-SLiver07 database is publicly available through the MICCAI 2007 
Segmentation of the Liver challenge [2] . This image database provides the ground 
truth segmentation of the training data (manual expert annotations), and have 
a slice resolution of 512x512 voxel and 64 to 502 slices. The inter-slice space 
varies between 0.7 mm and 5 mm and the intra-slice space varies between 0.56 
mmxO.56 mm and 0.86 mmxO.86 mm. 

Figure 2 shows three examples of our results of the MICCAI-SLiver07 testing 
data [2] . In each example, the first column is the original image data. The second 
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Fig. 2. Three liver segmentation examples using the MICCAI-SLiver07 testing data [2] . 
In each example, the first column shows the original image data in coronal, sagittal, and 
axial planes. The second column represents the segmentation results of these sections. 
The fourth row shows the 3D liver segmentation of our method. 


column shows the segmentation results. The fourth row represents the 3D liver 
segmentation results. The examples given in Figs. 2 have a different contrast. 
The boundary ambiguity between the liver and surrounding organs is noted in all 
examples. The results show the ability of our method to solve this segmentation 
challenge by considering the global and local image information in the level set 
formulation. 

The performance of the proposed method has been evaluated using five met¬ 
rics proposed by the MICCAI-SLiverO? challenge organizers [2]. First, the overlap 
error is defined as VOE= 100(1 — {\H n T\/\H U T|)) and is given in percent. 
Second, the relative volume difference is defined as VD= 100(|iJ| — |T|/|T|) 
and is given in percent. It gives an indication whether the results are over- or 
under-segmentation. Third, the average symmetric surface Distance (AvgD) is 
the average distance of all surface voxels distances between each surface voxel 
in H and the closest surface voxel in T. Forth, the root mean square symmet¬ 
ric surface distance (RMSD) is based on the surface distance and is given in 
millimeters. Fifth, the maximum symmetric surface distance (MaxD) represents 
the maximum differences between both sets of surface voxels in H and T and 
is given in millimeters. It also known as the Hausdorff distance. Assuming that 
H and T are two sets of voxels of the segmentation results and ground truth 
segmentation respectively. 

The results of the testing data were sent to the MICCAI-SLiver07 chal¬ 
lenge organizers and the evaluation is obtained by the organizers according to 
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Table 1. Our average results of MICCAI-SLiver07 testing data [2] using evaluation 
metrics [2], The maximum scores (Scr) is 100. The results for each metric report the 
mean over all images, together with mean scores. All scores are averaged to a final 
score over all images. Note: This results are obtained by MICCAI-SLiver07 challenge 
organizers and they are cited on http : //sliverO? . org/showresult .php?rank= 
12&submission=2014-09-20-2136. 


Method 

VOE 

[%] 

Scr 

VD 

[%] 

Scr 

AvgD 

[mm] 

Scr 

RMSD 

[mm] 

Scr 

MaxD 

[mm] 

Scr 

Total 

Scr 

1 

6.74 

73.7 

2.45 

87.0 

0.94 

76.4 

1.48 

79.5 

14.98 

80.3 

79.4 

2 

7.49 

70.7 

3.35 

82.2 

1.10 

72.5 

2.18 

69.7 

25.61 

66.3 

72.3 

3 

5.30 

79.3 

0.92 

95.1 

1.01 

74.7 

1.57 

78.2 

22.21 

70.8 

79.6 

4 

6.32 

75.3 

-0.81 

95.7 

0.92 

76.9 

1.55 

78.4 

13.98 

81.6 

81.6 

5 

6.20 

75.8 

1.62 

91.4 

1.02 

74.5 

1.90 

73.7 

19.40 

74.5 

78.0 

6 

6.55 

74.4 

-0.15 

99.2 

0.99 

75.3 

1.51 

79.0 

13.14 

82.7 

82.1 

7 

6.30 

75.4 

3.70 

80.3 

0.89 

77.8 

1.33 

81.6 

10.34 

86.4 

80.3 

8 

6.17 

75.9 

3.39 

82.0 

0.97 

75.8 

1.51 

79.0 

11.81 

84.5 

79.4 

9 

7.53 

70.6 

1.92 

89.8 

0.88 

78.0 

1.36 

81.2 

17.09 

77.5 

79.4 

10 

5.78 

77.4 

-1.05 

94.4 

0.81 

79.8 

1.40 

80.6 

10.61 

86.0 

83.7 

Avg. 

6.44 

74.9 

1.53 

89.7 

0.95 

76.3 

1.58 

78.1 

15.92 

79.1 

79.6 


Table 2. Our results compared to the methods [.’5,4,5]. The maximum scores (Scr) is 
100%. The results for each metric report the mean over all testing data, with mean 
scores. 


Alg. 

VOE 

[%] 

Scr 

[%] 

AvgD 

[mm] 

Scr 

[%] 

RMSD 

[mm] 

Scr 

[%] 

# Vol. 
Tested 

# Train. 
Vol. 

Average 
Scr [%] 

Ours 

6.44 

74.9 

0.95 

76.3 

1.58 

78.1 

10 

20 

79.6 

Alg. [3] 

6.09 

76.2 

0.95 

76.3 

1.87 

74.0 

10 

112 

77.3 

Alg. [4] 

6.47 

74.7 

1.02 

74.5 

2.00 

72.3 

10 

20 

76.8 

Alg. [5] 

6.37 

75.1 

1.00 

74.9 

1.92 

73.4 

10 

92 

76.2 


the conditions of the challenge [2]. Table 1 shows the results of 10 patients of 
SLiverO? testing data [2]. Table 1 illustrates five different metrics (VOE, VD, 
AvgD, RMSD, and MaxD) with the average value (Avg) of all of these metrics. 

The proposed method achieves the highest rank (segmentation accuracy) 
among 35 automatic methods submitted in [20] . The results of our method have 
also been published online [20]. In MICCAI-SLiver07 challenge, some automatic 
methods, such as the methods presented in [3,5], used large training data and 
obtained their results using the training data of 112 and 92 training image re¬ 
spectively instead of the 20 patients. 

The evaluation of the MICCAI-SLiverO? challenge organizers [2] has become 
a reference for liver segmentation enabling efficient and precise comparisons [2]. 
Thus, we compare the proposed method to the best automatic methods [3,4,5], 
which were evaluated by challenge organizers, as shown in Table 2. 
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4 Conclusion 

This paper presents a fully automatic 3D liver segmentation method based on 
level set formulation using dictionary learning with shape prior. We integrate the 
data and regularization terms of the level set energy equation in a novel fashion. 
Two dictionaries are used in our framework: the feature dictionary, which repre¬ 
sents the global image features, and the voxel-wise dictionary, which represents 
the shape prior (local image information). The proposed method has been tested 
using MICCAI-SLiverO? database [2]. The evaluation results of the MICCAI- 
SLiverO? testing data are computed by the MICCAI-SLiverO? challenge organiz¬ 
ers. The experimental results show that our method achieves the best segmen¬ 
tation accuracy (79.6%) among more than 35 automatic segmentation methods 
cited on MICCAI-SLiverO? challenge website [20] . 
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